Learning Implicit User Interest Hierarchy for Web Personalization

نویسندگان

  • Hyoung-rae Kim
  • Philip K. Chan
چکیده

Learning Implicit User Interest Hierarchy for Web Personalization by Hyoung-rae Kim Dissertation Advisor: Philip K. Chan, Ph.D. Most web search engines are designed to serve all users in a general way, without considering the interests of individual users. In contrast, personalized web search engines incorporate an individual user's interests when choosing relevant web pages to return. In order to provide a more robust context for personalization, a user interest hierarchy (UIH) is presented. The UIH extracts a continuum of general to specific user interests from web pages and generates a uniquely personalized order to search results. This dissertation consists of five main parts. First, a divisive hierarchical clustering (DHC) algorithm is proposed to group words (topics) into a hierarchy where more general interests are represented by a larger set of words. Second, a variable-length phrase-finding (VPF) algorithm that finds meaningful phrases from a web page is introduced. Third, two new desirable properties that a correlation function should satisfy are proposed. These properties will help understand the general characteristics of a correlation function and help choose or devise correct correlation functions for an application domain. Fourth, methods are examined that (re)rank the results from a search engine depending on user interests based on the contents of a web page and the UIH. Fifth, previously studied implicit

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Personalized Web Search by User Interest Hierarchy

Most of the web search engines are designed to serve all users, independent of the needs of any individual user. Personalization of web search is to carry out retrieval for each user incorporating individual user’s interests. Current personalization techniques use the hierarchy of hyperlinks or map a user query to a set of categories etc. But, they do not use both the contents of a web page and...

متن کامل

Keyword Extraction Based on Implicit Feedback

To improve the results from search engines and make them more personalized for the user, we need to find out about the interests of a particular user. Many of the search personalization methods analyse documents visited by the user and from these documents infer the user’s interests. However, this approach is not accurate, because the user is rarely interested in the whole document; he might be...

متن کامل

Improving Web Personalization via User Interest Hierarchy and Scoring Techniques

The World Wide Web is rampantly growing, providing users with a vast amount of information from which to search and explore. However, the retrieval of information is rather basic in that search engines often do not make distinctions between users; most search engines provide different users with identical results when their search queries are the same. We assert that this is naive and rather sh...

متن کامل

A Web Usage Mining Framework for Web Directories Personalization

In this thesis we propose a novel framework that combines Web personalization and Web directories, which results in the concept of Community Web Directories. Community Web directories is a novel form of personalization performed on Web directories, that correspond to “segments” of the directory hierarchy, representing the interests and preferences of user communities. The proposed approach is b...

متن کامل

Personalized Web Search by Using Learned User Profiles in Re-ranking

Search engines return results mainly based on the submitted query; however, the same query could be in different contexts because individual users have different interests. To improve the relevance of search results, we propose re-ranking results based on a learned user profile. In our previous work we introduced a scoring function for re-ranking search results based on a learned User Interest ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005